Phonetic Modelling in the Philips Chinese Continuous Speech Recognition System

نویسندگان

  • Frank Seide
  • Nick J C Wang
چکیده

We have extended the Philips large vocabulary continuous speech recognition system towards Chinese On the way from our existing Western language technology to Mandarin the rst step was to build a suitable phonetic model This paper describes the development of our phonetic model excluding tones for Mandarin Chinese We will present a systematic comparison of three forms of sub syllabic units for Chinese phonemes initials nals and a non tonal form of preme toneme models as well as whole syllable models for reference We include experiments on bottom up and decision tree based top down state clustering and modelling of cross syllable contexts All forms of sub syllabic units are represented in the Philips Mandarin phone set SAMPA C SAMPA C is based on the European SAMPA standard and introduced in this paper Our studies show that traditional half syllable approaches slightly outperform Western style triphones Modelling of right context dependency gives greater improvement than left context dependency and cross syllable modelling yields a performance gain In a free syllable decoding task we achieve syllable error rate for telephone speech and for microphone dictations

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Super Phonetic System and Multi-dialect Chinese Speech Corpus for Speech Recognition

In this paper, we describe the work on Chinese multi-dialect speech processing. Based on the phonetic analysis of ten Chinese dialects, we have created a Chinese super phonetic system for the Chinese speech recognition. To exam this phonetic system and develop Chinese dialect speech technology, we are building a multi-dialect speech corpus, which includes 10 dialect areas and 2000 speakers.

متن کامل

Modeling context-dependent phonetic units in a continuous speech recognition system for Mandarin Chinese

We study the problem of phonetic modeling for continuous Mandarin speech recognition by providing a systematic performance comparison for systems based on following primitive speech units: syllable, demi-syllable (Initials and Finals), context-independent phones, left-or-right context-dependentphones (diphones), and leftand-right context-dependent phones (triphones). In our speakerdependent con...

متن کامل

Development of the philips 1999 taiwan Mandarin benchmark system

This paper describes the Philips Large Vocabulary Continuous Mandarin speech recognition system for the 1999 Taiwan benchmark. The basic system architecture is based on the Philips LVCSR technology developed for Western languages. However, several modifications are made in order to better suitted processing Chinese spoken languages. In the paper, we present some experimental results on the two ...

متن کامل

First Experiments on an Hmm Based Double Layer Framework for Automatic Continuous Speech Recognition

The usual approach to automatic continuous speech recognition is what can be called the acoustic-phonetic modelling approach. In this approach, voice is considered to hold two different kinds of information—acoustic and phonetic—. Acoustic information is represented by some kind of feature extraction out of the voice signal, and phonetic information is extracted from the vocabulary of the task ...

متن کامل

Spoken Term Detection for Persian News of Islamic Republic of Iran Broadcasting

Islamic Republic of Iran Broadcasting (IRIB) as one of the biggest broadcasting organizations, produces thousands of hours of media content daily. Accordingly, the IRIBchr('39')s archive is one of the richest archives in Iran containing a huge amount of multimedia data. Monitoring this massive volume of data, and brows and retrieval of this archive is one of the key issues for this broadcasting...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1998